Fiqure 1 ? Predicted protein sequence of mGy12 (SEQ ID NO:1) 

1 MSSKTASTNS IAQARRTVQQ LRLEASIERI KVSKASADLM SYCEEHARSD 
51 PLLMGIPTSE NPFKDKKTCI IL* 




Figure 2: cDNA sequence of mGyl 2 variant 1 (SEQ ID N0:2) 



1 


CTAGAATTCA GCGGCCGCTG AATTCTAGGC GACGACGGCG AAGAGTGAGT 


51 


GCCAAGGTTC 


ATATGGGAAG 


GACTTTGGGG 


TGAGCATCTT 


CTPTATTTPT 1 


101 


AGCTGGCTTT 


TCTGATTTTC 


AGAAAGAAGA 


CTCATCAAAG 




151 


AGACGGCAAG 


CACCAACAGC 


ATAGCCCAAG 


CCAGGAGAAC 


lvJl VJV_,.M.Vj-^,HA_7 


201 


CTGAGATTGG 


AAGCCTCCAT 


C G AAAGAAT A 


AAGGTCTCAA 


A anfa Tfflpp 


251 


AGACCTGATG 


TCATACTGTG 


AGGAGCATGC 


CCGGAGCGAC 


rrrrTnpTra 


301 


TGGGCATACC 


GACCTCAGAA 


AACCCGTTCA 


AGGATAAG A A 


wi-iv_ V_ X LjL. A 1 L- 


351 


ATCTTATAGT 


GGACCAGGAA 


GCGCCCCTTG 


CCTCTTAACG 


r*A Z\APPZ\pAp 


4 01 


CAGCAACCTG 


AAGGGATTCC 


TTCAGCTTAC 


CTGGTAAPCA 


V_i-iVji^_ X i-iva 1 AA 


451 


CTAAAACACC 


CTTCTCTCGG 


AATAATAGAC 




PTPTTTTTP A 


501 


AGTTGTCCTT 


TCTTCACCCT 


TTACTGATTT 


AATAPAfJA AT 




551 


TTTTCTATTT 


GATAACTATG 


GTATCATATT 


GGGTTACTfJT 


ATZISPPAAAA 
-H. 1 -HA<otLjAAAA 


601 


TGGCAGGGGA 


GTTGTGGGAA 


GCTTGTCTTT 


AC A A A A TATA 


A TTP A T'T 1 A A f~* 

-H.X XLjAX lAAb 


-J X 


ATATGTCAAG 


ACCTACATTG 


TCTAAGCACC 


GGPA A ATT A A 


a a r vc %r vr^f~^ A P A 
AA X \j X L-LjALtA 


701 


ATCACTTCAG 


TCAAAAACCT 


TTATATTCTG 


TTPTTA ATA A 

x x x X rlrl X riri 


TPTTTPTPPP 


751 


AACCTATATC 


CCATGTAAGG 


GATCTGGGGA 


GGAGGCATHT 


PTPTSPA APP 


801 


ATACCTTTTT 


GCACTATGGG 


CACTAACCAC 


CCTGAAAGTT 


*~ ^ X kjVwVjiAj 1 ALi 


851 


CTCCCTCCCT 


TCAGAGTTAC 


ATCATTATCC 


TGACTCTGTG 

x vjn,v„ x X \J X \j 


TAP^fiTA a a tt 

X i-iVjVj li-LH-H. 1 X 


901 


TCCGTGAAAT 


TTTTGTACAA 


AAAAAAGGTA 


ATGAAAGAA C 


nTTnpzi a apA 


951 


TCATCTGCAT 


TATAATGAGT 


TGATGCTGTT 


CTCACTCCTC 


TPTTPIPIZI A TT 


1001 




CTTAGTCTAC 


AATAAACTGT 


GCCAATTAAA 


AACCTAAGGC 


1051 


TAAAACTGAA 


AGCCCTTTGA 


TGGGGTCTTA 


ACTCATATCA 


GTCATTTGGG 


1101 


CTTCTCTGAT 


CCTGAGGCTA 


AGAAAGGGGA 


AGAGACCCTC 


AGGAGGCAGC 


1151 


TTCCACTCCA 


GGGCTCTTGA 


TCTCTGCTGG 


ATTGGGGGTG 


GCCACCTCAG 


1201 


AAACTTCCAC 


CCTCATGACT 


GGAATGGAAG 


AGGGGACCGA 


GAGCCTCACA 




12 51 ATCTCGGAGA GGGAGGAGAA ATTCTTAAAA ACAGCTGCTC TCCTGCGCCC 

13 01 AGCTTCACAG GCAGCCCTGC CCCTTTCTCC TCAC CAGCAT GGTACCTGCC 
1351 CTTACTGCTA GAGCAGCTGC TTGTAGAGGG ACATTCCCTC CTTCCCAGTT 

14 01 TTAACTGGTG G AC C AC AGTG GGGGGAAAAA CATTCAAGCG ATATAAAGAC 
14 51 ACTTGGGCTC TTTGCAGATG CCTATACTTC CAACACTACC ATGTCCACAA 
1501 ACCACCCTGG GGGAGGGCCC TTCCAAAGGG AGGCTTGCTA GTTTCAGCGT 
1551 CTAGCAGTTG GGTCCTCACT TTTACTCCAA TTGTGAAAAT AGCCCACGTA 
1601 CCCTCGCAGT GTCCAGTAGG GATCCCAGAG GCACATAACC AAGAAAGGAT 
1651 TTTGACTTTG TCACAGTGAC TATTTAAAAT AATCTATTCG AAGTCCAAAC 
1701 CAAACACAAA GCCTGTGATA TTTTAGGTTA TTAAGGTAAC TGCTAATGAA 
1751 GGATTTTAAA AAGTGTCCTC AAAAAGGACT TAGCCCCGGG AGTTGTTTAT 
1801 AAAATTTCCC CCACTTGTAT ACAGTGTGCA CTAAAAGAAA ATGTATTTTA 
1851 ATATCTAATG CCTGGGCTCT GAGCGTCATG CTTCTTGGTG GTAAACATGC 
1901 AGTCCTGTTC CTAAGTGACT CAGAGGCATC AGAATTTCTC CACGTTACCC 
1951 ATCTGCTTGG CACTCGGAAC TGAGCGTGTG AAATC CATAG CGCTGCCCAC 
2 0 01 AACCTGTTCT CACTGCTTAG CTCCCAGCTG GATTAAAGAC ACCTGCTCAG 
2 051 GCGGGAGAGA GAGAGAGAGA GCGAGCTTTT ACCTTGGAAA AGGTAAAGAT 
2101 GGAAATGTAC ACCAAAAAAG ACAATTTTTA CATTTAATGG AACATTCTTT 
2151 TTTTTTACAA GTATATTTTT CTACTGATAG TTTCAGAACA CTAATCTTAT 

22 01 ATTCACTCTA ATCTTAAACA TGTTTCTTTA AATATTTATA AGGCAGTTTA 
2 2 51 TTACAGAATA TTTTCATGCA ATCATGTGCA CATTATTGGT AGCAAACATA 

23 01 GTATATCCTT TAGTACTTTA GCATATTTTT GTTAAAATAC TTTTAATGGT 

23 51 AAGAAATGAA CTTGAGGTCC CAGGAGGTTT TGTTGCCTTT TCATTGATTA 

24 01 GAGACAATAA ATATCTTGTA ACTTCCTAAC CAGATCTGAG CTGTGCTCAC 
24 51 AATAATAATA ATGAAATCAG ATTCTTTGAT GCTGGACTCA GGAGGGAAAT 




2501 


CATTAGCCAA 


CTGTTGACTT 


ACTTATAGCT 


AGATGTCTAT 


GTGAGAAAGT 


2551 


ATAATATATA 


TATATACACA 


TATATATGAC 


ATGTAAGAGT 


CACTTTTATT 


2601 


TATCTGTCTT 


TGTTCACTTA 


TGAAGCCGGT 


AACTGCAGCA 


GTATGTTGGT 


2651 


GATGTCATGA 


TGCACAGAAG 


TCCCATGTGG 


AGTGTTTTTC 


CCACACTGAC 


2701 


AACTTGGCCT 


CCTTTCTGTG 


TGTTCAGTCT 


GTTGTCTGAA 


CTAACACTCC 


2751 


CGCGAGCACT 


ATACTCTTTA 


TACTCTGATC 


CCCCTAGTTC 


ATCTTAAATT 


2801 


TGTCTGTGGC 


CCTGGCAAGA 


TAGCGTACAC 


AAGATTCCAT 


GACTCCAGAG 


2851 


CATCTTGAAG 


AAACATACAT 


ATTTTGAAAG 


AGGGGAAATG 


TAGCAGATAG 


2901 


TTCACAAGCT 


GCGGGTTGTA 


GCTAAATATT 


CCATTTCTTT 


GAAATCATGT 


2951 


TTCTAAATTC 


T TT AC CAT C A 


GAAAGAAAAG 


GAGTGTCATA 


CACTTTCAAG 


3001 


GGAAGGCTTG 


GTCTGCGTTT 


TCTGTGTTTG 


GATTATTTTT 


ATACTTTGCT 


3051 


GATCTTTAAG 


CTATCCATGG 


GGGAAATTTT 


ATACCAACGA 


GTTAATAATT 


3101 


CTCATTCATC 


GTTTACACAA 


TGTAACATGT 


GTCATACTGG 


GGCCAGCGAG 


3151 


ATGGCTCAGT 


AGGTAAAGGT 


GCTTGATGCT 


AAGCCCGGCA 


GCCTGTGTTT 


3201 


CATCTACAGG 


ATGCACAACA 


TAAAAGAAAA 


GATCTGATTC 


CCACAGGTTC 


3251 


TCTTCTGACC 


TACACACACA 


C AC AC T AAAA 


TAACATTTAA 


AAATATGTGC 


3301 


CAAATTATAT 


TTGTTCGGGT 


GCCACCTTCC 


ACCAGCTTAC 


CACTACGGTA 


3351 


GAACTGTCAA 


ATTCATCTCC 


CTGAATTTGT 


CTTAAAGGGG 


TGTCCATGCA 


3401 


CAGGCCCAAG 


AGTCACCTCC 


AATGAAATAA 


ATGTAATACT 


GAAGTATGCC 


3451 


ATGATGTTTG 


TTGTTTTCTT 


TCATCGTAAG 


CCTGTAAGCA 


GGAAAAATAC 


3501 


GTCAAATCAG 


ATAGAATAGA 


GCATTTACCA 


GTGGTCGATG 


GCCTGGTCAG 


3551 


TCCTGTGCCG 


GGTGACTTAG 


GACCAGGCAC 


GTCAGCTCTC 


TGAGCCTCCC 


3601 


CTTCCCTTGT 


TGTCACAAGG 


GAATAGAAGC 


AGAAGAAGCT 


GAGAGCCTCC 


3651 


CTATTCCCAG 


ATGCCCTGGT 


GGAATGACCT 


GCCTCTCTGC 


CGTTTCTGCC 


3701 


AACGTGTTGG 


TGCTATAAGC 


TGCTTCAAAT 


ACCAGTTTGT 


CTGTAGTGTG 


3751 


TACTCACCTA 


ATCACTTGTT 


ATCCAGTGCC 


TGTTCTAGGT 


TTATGGACTT 




3 8 01 AACTATTTCT GTGATGTTTC 

3 851 TTCTCTTATG TCTCAGTAAA 

3 901 TATTTGATAA TATTCTTCGG 

3 951 ACTGTGATTG TTATTTGCAA 

4 001 TTTTTCACTT GCTAACATGC 
4 051 CTTTACTTAC TGGGTCAGTG 
4101 ATAAATTAAT AAAAGACCAA 




ATTTTTAGCC ATGTTAACTC CTAACACATA 
GTTTCATTTG ATAAGTTGTT GAGATTCTGT 
CTGTCCATCC AGCATCTTAA TCACTTTAAA 
CTCTGTTCTT TGGAAAGAAT AAAAGCATTT 
TCACAAATGT GAGAGAAGAG TCATTAAAAG 
CGTCATTGAC TCCTTTCTGT GTTTTGCCCA 
AAAAAAAAAA AAAAAAAAAA AAAAAAA 




Figure 3: cDNA sequence of mGyl2 variant 2 (SEQ ID NO:3) 



1 


L. A Ca C_ Cjj Lj C Cj 


CGGCGGCGAC 


GACGGCGAAG 


AGTTCATATG 


GGAAGGACTT 


O 1 


1 LjCjCjCj 1 CjAGC 


ATCTTCTCTA 


TTTCCAGCTG 


GCTTTTCTGA 


TTCACCCCAC 


1 U 1 


CATTTAAAAC 


CTGGAGGCAC 


TGGGCCACAC 


AAAGCCTTGC 


TGATTTTCAG 


"1 C T. 


AAAGAAGAC T 


CATCAAAGAT 


GTCCAGCAAG 


ACGGCAAGCA 


CCAACAGCAT 


oat 


AGCCCAAGCC 


AGGAGAACTG 


TGCAGCAGCT 


GAGATTGGAA 


GCCTCCATCG 


OCT 


7V "7V TV *R TV m TV T\ TV 

AAAGAATAAA 


GGTCTCAAAA 


GCATCAGCAG 


ACCTGATGTC 


ATACTGTGAG 


J 0 1 


GAGCATGCCC 


GGAGCGACCC 


CCTGCTGATG 


GGCATACCGA 


CCTCAGAAAA 


ibl 


CCCGTTCAAG 


GATAAGAAGA 


CCTGCATCAT 


CTTATAGTGG 


ACCAGGAAGC 


A A 1 

4 01 


GCCCCTTGCC 


TCTTAACGCA 


AACCACAGCA 


GCAACCTGAA 


GGGATTCCTT 


4 b 1 


CAGCTTACCT 


GGTAACCACA 


GCTAGTAACT 


AAAACACCCT 


TCTCTCGGAA 


b 0 1 


TAATAGACCC 


TGAAGTCTCT 


CTTTTTCAAG 


TTGTCCTTTC 


TTCACCCTTT 


bb 1 


T\ /"""I m /*"-t tv > ii m r r l tv t* 

ACTGATTTAA 


TACAGAATAA 


CAATCTTATT 


TTCTATTTGA 


TAACTATGGT 


6 01 


ATCATATTGG 


GTTACTGTAT 


AAGGAAAATG 


GCAGGGGAGT 


TGTGGGAAGC 


651 


TTGTCTTTAC 


AAAATATAAT 


TGATTAAGAT 


ATGTCAAGAC 


CTACATTGTC 


TAT 

7 01 


TAAGCACCGG 


CAAATTAAAA 


TGTCGAGAAT 


CACTTCAGTC 


AAAAACCTTT 


751 


ATATTCTGTT 


CTTAATAATG 


TTTGTGCCAA 


CCTATATCCC 


ATGTAAGGGA 


OAT 

8 01 


TCTGGGGAGG 


AGGCATGTGT 


CTACAACCAT 


ACCTTTTTGC 


ACTATGGGCA 


q a i 
obi 


CTAACCACCC 


TGAAACTTCC 


TGCGGTAGCT 


CCCTCCCTTC 


AGAGTTACAT 


y u i 


CATTATCCTG 


ACTCTGTGTA 


GGTAAATTTC 


CGTGAAATTT 


TTGTACAAAA 


951 


AAAAGGTAAT 


GAAAGAACGT 


TGCAAAGATC 


ATCTGCATTA 


TAATGAGTTG 


1001 


ATGCTGTTCT 


CACTCCTCTC 


TTGGAATTGT 


GCTGGCCCCT 


TAGTCTACAA 


1051 


TAAACTGTGC 


CAATTAAAAA 


CCTAAGGCTA 


AAACTGAAAG 


CCCTTTGATG 


1101 


GGGTCTTAAC 


TCATATCAGT 


CATTTGGGCT 


TCTCTGATCC 


TGAGGCTAAG 


1151 


AAAGGGGAAG 


AGACCCTCAG 


GAGGCAGCTT 


CCACTCCAGG 


GCTCTTGATC 



# 



1201 


TCTGCTGGAT 


TGGGGGTGGC 


CACCTCAGAA 


ACTTCCACCC 


TCATGACTGG 


1251 


AATGGAAGAG 


GGGACCGAGA 


GCCTCACAAT 


CTCGGAGAGG 


GAGGAGAAAT 


1301 


TCTTAAAAAC 


AGCTGCTCTC 


CTGCGCCCAG 


CTTCACAGGC 


AGCCCTGCCC 


1351 


CTTTCTCCTC 


ACCAGCATGG 


TACCTGCCCT 


TACTGCTAGA 


GCAGCTGCTT 


1401 


GTAGAGGGAC 


ATTCCCTCCT 


TCCCAGTTTT 


AACTGGTGGA 


CCACAGTGGG 


1451 


GGGAAAAACA 


TTCAAGCGAT 


ATAAAGACAC 


TTGGGCTCTT 


TGCAGATGCC 


1501 


TATACTTCCA 


ACACTACCAT 


GTCCACAAAC 


CACCCTGGGG 


GAGGGCCCTT 


1551 


CCAAAGGGAG 


GCTTGCTAGT 


TTCAGCGTCT 


AGCAGTTGGG 


TCCTCACTTT 


1601 


TACTCCAATT 


GTGAAAATAG 


CCCACGTACC 


CTCGCAGTGT 


CCAGTAGGGA 


1651 


TCCCAGAGGC 


ACATAAC CAA 


GAAAGGATTT 


TGACTTTGTC 


ACAGTGACTA 


1701 


TTTAAAATAA 


TCTATTCGAA 


GTCCAAACCA 


AACACAAAGC 


CTGTGATATT 


1751 


TTAGGTTATT 


AAGGTAACTG 


CTAATGAAGG 


ATTTTAAAAA 


GTGTCCTCAA 


1801 


AAAGGACTTA 


GCCCCGGGAG 


TTGTTTATAA 


AATTTCCCCC 


ACTTGTATAC 


1851 


AGTGTGCACT 


AAAAGAAAAT 


GTATTTTAAT 


ATCTAATGCC 


TGGGCTCTGA 


1901 


GCGTCATGCT 


TCTTGGTGGT 


AAACATGCAG 


TCCTGTTCCT 


AAGTGACTCA 


1951 


GAGGCATCAG 


AATTTCTCCA 


CGTTACCCAT 


CTGCTTGGCA 


CTCGGAACTG 


2001 


AGCGTGTGAA 


ATCCATAGCG 


CTGCCCACAA 


CCTGTTCTCA 


CTGCTTAGCT 


2051 


CCCAGCTGGA 


TTAAAGACAC 


CTGCTCAGGC 


GGGAGAGAGA 


GAGAGAGAGC 


2101 


GAGCTTTTAC 


CTTGGAAAAG 


GTAAAGATGG 


AAATGTACAC 


CAAAAAAGAC 


2151 


AATTTTTACA 


TTTAATGGAA 


CATTCTTTTT 


TTTTACAAGT 


ATATTTTTCT 


2201 


ACTGATAGTT 


TCAGAACACT 


AATCTTATAT 


TCACTCTAAT 


CTTAAACATG 


2251 


TTTCTTTAAA 


TATTTATAAG 


GCAGTTTATT 


ACAGAATATT 


TTCATGCAAT 


2301 


CATGTGCACA 


TTATTGGTAG 


CAAACATAGT 


ATATCCTTTA 


GTACTTTAGC 


2351 


ATATTTTTGT 


TAAAATACTT 


TTAATGGTAA 


GAAATGAACT 


TGAGGTCCCA 


2401 


GGAGGTTTTG 


TTGCCTTTTC 


ATTGATTAGA 


GACAATAAAT 


ATCTTGTAAC 



2451 


TTCCTAACCA 


GATCTGAGCT 


GTGCTCACAA 


TAATAATAAT 


G AAAT C AG AT 


2501 


TCTTTGATGC 


TGGACTCAGG 


AGGGAAATCA 


TTAGCCAACT 


GTTGACTTAC 


2551 


TTATAGCTAG 


ATGTCTATGT 


GAGAAAGTAT 


AATATATATA 


TATACACATA 


2601 


TATATGACAT 


GTAAGAGTCA 


CTTTTATTTA 


TCTGTCTTTG 


TTCACTTATG 


2651 


AAGCCGGTAA 


CTGCAGCAGT 


ATGTTGGTGA 


TGTCATGATG 


CACAGAAGTC 


2701 


CCATGTGGAG 


TGTTTTTCCC 


ACACTGACAA 


CTTGGCCTCC 


TTTCTGTGTG 


2751 


TTCAGTCTGT 


TGTCTGAACT 


AACACTCCCG 


CGAGCACTAT 


ACTCTTTATA 


2801 


CTCTGATCCC 


CCTAGTTCAT 


CTTAAATTTG 


TCTGTGGCCC 


TGGCAAGATA 


2851 


GCGTACACAA 


GATT C CAT G A 


CTCCAGAGCA 


TCTTGAAGAA 


ACATACATAT 


2901 


TTTGAAAGAG 


GGGAAATGTA 


GCAGATAGTT 


CACAAGCTGC 


GGGTTGTAGC 


2951 


TAAATATTCC 


ATTTCTTTGA 


AATCATGTTT 


CTAAATTCTT 


TACCATCAGA 


3001 


AAGAAAAGGA 


GTGTCATACA 


CTTTCAAGGG 


AAGGCTTGGT 


CTGCGTTTTC 


3051 


TGTGTTTGGA 


TTATTTTTAT 


ACTTTGCTGA 


TCTTTAAGCT 


ATCCATGGGG 


3101 


GAAATTTTAT 


ACCAACGAGT 


TAATAATTCT 


CATTCATCGT 


TTACACAATG 


3151 


TAACATGTGT 


CATACTGGGG 


CCAGCGAGAT 


GGCTCAGTAG 


GTAAAGGTGC 


3201 


TTGATGCTAA 


GCCCGGCAGC 


CTGTGTTTCA 


TCTACAGGAT 


GCACAACATA 


3251 


AAAGAAAAGA 


TCTGATTCCC 


ACAGGTTCTC 


TTCTGACCTA 


CACACACACA 


3301 


CACTAAAATA 


ACATTTAAAA 


ATATGTGCCA 


AATTATATTT 


GTTCGGGTGC 


3351 


CACCTTCCAC 


CAGCTTACCA 


CTACGGTAGA 


ACTGTCAAAT 


TCATCTCCCT 


3401 


GAATTTGTCT 


TAAAGGGGTG 


TCCATGCACA 


GGCCCAAGAG 


TCACCTCCAA 


3451 


TGAAATAAAT 


GTAATACTGA 


AGTATGCCAT 


GATGTTTGTT 


GTTTTCTTTC 


3501 


ATCGTAAGCC 


TGTAAGCAGG 


AAAAATACGT 


CAAATCAGAT 


AGAATAGAGC 


3551 


ATTTACCAGT 


GGTCGATGGC 


CTGGTCAGTC 


CTGTGCCGGG 


TGACTTAGGA 


3601 


CCAGGCACGT 


CAGCTCTCTG 


AGCCTCCCCT 


TCCCTTGTTG 


T C AC AAGGG A 


3651 


ATAGAAGCAG 


AAGAAGCTGA 


GAGCCTCCCT 


ATTCCCAGAT 


GCCCTGGTGG 


3701 


AATGACCTGC 


CTCTCTGCCG 


TTTCTGCCAA 


CGTGTTGGTG 


CTATAAGCTG 



3 751 CTTCAAATAC CAGTTTGTCT 

3801 CCAGTGCCTG TTCTAGGTTT 

3 8 51 TTTTAGCCAT GTTAACTCCT 

3 901 TTCATTTGAT AAGTTGTTGA 

3 951 GTCCATCCAG CATCTTAATC 

4 001 CTGTTCTTTG GAAAGAATAA 
4 051 ACAAATGTGA GAGAAGAGTC 
4101 TCATTGACTC CTTTCTGTGT 
4151 AAAAAAAAAA AAAAAAAAAA 



GTAGTGTGTA CTCACCTAAT CACTTGTTAT 
ATGGACTTAA CTATTTCTGT GATGTTTCAT 
AACACATATT CTCTTATGTC TCAGTAAAGT 
GATTCTGTTA TTTGATAATA TTCTTCGGCT 
ACTTTAAAAC TGTGATTGTT ATTTGCAACT 
AAGCATTTTT TTTCACTTGC TAACATGCTC 
ATTAAAAGCT TTACTTACTG GGTCAGTGCG 
TTTGCCCAAT AAATTAATAA AAGACCAAAA 
AAAAA 



Figure 4: amino acid sequence of human Gyl2 (SEQ ID NO:4) 

1. MSSKTASTNN IAQARRTVQQ LRLEASIERI KVSKASADLM SYCEEHARSD 
51. PLLIGIPTSE NPFKDKKTCI IL 



